arXiv: 1505.01165vl [math.PR] 5 May 2015 


A mixing tree-valued process arising under neutral 
evolution with recombination 

Andrej Depperschmidt, Etienne Pardoux, Peter Pfaffelhuber 

May 7, 2015 


Abstract 

The genealogy at a single locus of a constant size N population in equi¬ 
librium is given by the well-known Kingman’s coalescent. When considering 
multiple loci under recombination, the ancestral recombination graph encodes 
the genealogies at all loci in one graph. For a continuous genome (E, we study 
the tree-valued process of genealogies along the genome in the limit 

N —>■ oo. Encoding trees as metric measure spaces, we show convergence to a 
tree-valued process with cadlag paths. In addition, we study mixing properties 
of the resulting process for loci which are far apart. 


1 Introduction 


A large body of literature within the area of mathematical population genetics is 
dealing with models for population of constant size. While finite models such as 
the Wright-Fisher or the Moran model all have their specificities, the limit of large 
populations - given some moments are bounded - leads to a unified frame work with 
diffusions and genealogical trees as their main tools; see e.g. EwensI ( 2004). In finite 
population models of size N - frequently denoted Cannings models ( Canning. 1974ll 
— the offspring distribution of all individuals in each generation is exchangeable and 
subject to the constraint of a constant population size. 

Neutral evolution accounts for the fact that all individuals have the same chance 
to produce offspring in next generations. Recombination is the evolutionary force by 
which genetic material from more than one (i.e. two in all biologically relevant cases) 
parents is mixed in their offspring. Genealogies under neutral ev olution without re¬ 
combination are given through the famous Kingman coalescent (jKingmanl . Il982l l , a 
random binary tree where pairs of lines merge exchangeably in a Markovian fash¬ 
ion. Genealogies under recombination must deal with the fact that recombination 
events mix up genetical material from the parents. As a consequence, lines not only 
merge due to joint ancestry, but also split due to different ancestors for the genetic 
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material along the genome. The resulting genealogy is encoded i n the Ancestral Re¬ 
combination Graph (ARG), which appeared already inlHudsonl (1198311. but entere d 
the mathematical literature only in Griffith^ ( 1991 1: Griffiths and Marjoram (Il997l l. 
This graph gives the genealogies of all genetic loci under stationarity at once; see 
also Figure [H 

The sequence of genealogies along the chromosome is most important for biolog¬ 
ical applications, and fast simulation and infe rence of such genealogies is a major 
research topic today ( Rasmussen et ah . 2r)14l b While the ARG gives the sequence 
of genealogies from the present to the past, a construction of genealogies along th e 
chromosome is possible as well ( Wiuf and Heinl. 19991 : Leocard and Pardoux . 201fll b 
The advantage of the latter approach is that it allows to approximate the full se- 
quence by ignoring l ong-r ange dependencies, a fruitful research topic started by 
McVean and Gardin ( 2005l j. 


The goal of the present paper is to study the sequence of genealogies along the 
genome, denoted G, in the limit N —>■ oo. Precisely, we will use the not ion of (ultra-) 


metric measure spaces, introduced in the probabilistic community bv iGreven et al 


([20091), in order to formalize genealogical trees, read off the sequence from 

the ARG and let Ai —>■ oo. As main results, we obtain convergence (Theorem [1]) to an 
ergodic tree-valued process which has cadlag paths and study its mixing properties 
(Theorem [21). We start by introducing our notation. 


Remark 1.1 (Notation). Let {E,r) be a metric space. We denote by A4i{E) the 
space of all probability measures on E equipped with the Borel-ci-algebra B{E). 
The space Cb{E) consists of all continuous, bounded, real-valued functions defined 
on E. For a second metric space {F,rp) and fa € M.i[E) and a measurable map 
ip : E ^ F, the measure is the push-forward of p under p. We denote vectors 
{xi,X2, ...) € E^ by X and integrals will be frequently denoted by {v, f) := f fdv. 
Weak convergence of probability measures will be denoted by =i>. For ICR, the 
space De{I) is the set of cadlag functions f : I ^ E. 


2 Ancestry under recombination 

For a set of loci G, also called genome in the sequel, we aim to study the ancestry of 
individuals from a large population. The joint genealogy for all loci is given by the 
ancestral recombination graph (Section 12.1|) . from which we can read off genealogical 
trees at all loci u £ G (Section (221). In Section (221 we formalize random genealogies 
as metric measure spaces. 


2.1 The ancestral recombination graph 


In this section we give a formal definition of the ancest ral recombination graph (ARG) 

m ( 


which is a (slight) generalization of the definition from iGriffiths and Mariora: 
see also the leftmost branching and coalescing graph in Figured! 


(119971 1: 


Definition 2.1 (N-ancestral recombination graph). 

1. For a < 6, G := [a, 5], p > 0 and a finite set [A^] :={!,..., N}, the N-ancestral 
recombination graph (ARG), denoted by A := .4,'^ := 4.'^(G), starting with 
particles in the set [A^] is defined by the following Markovian dynamics: 
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A:= 


Tu{A) 

U<Ul,U2 


%,(A) 

U\<U<U2 


TRA) 

Ul,U2<U 


Figure 1: An example of an ARG A is given for N = 5 particles. From the resulting 
graph, where splitting events are marked by ui and U 2 (with ui < U 2 ), trees can 
be read off by using right and left particle at these splitting events. This results in 
three (realizations of) 5-Kingman coalescents. 


(i) When there are k > 2 particles, two randomly chosen particles coalesce 
(merge) at rate ( 2 ) and give rise to a single new particle. 

(ii) Each particle splits in two at rate p(b — a), resulting in a new left and a 
new right particle. Such a splitting event is marked by an independent, 
uniformly distributed random variable 17 E G. 


Denote by At the set of particles at time t > 0 and stop when there is only 
one particle left. 


2. The particle-counting process Af = (W)t>o with Nt = if At for is a birth- 
and death chain. Precisely, Af has birth rate = p{b — a)k and death rate 
dfc = ( 2 ), A: = 1, 2,... and is stopped at T = inf{t : Nt = 1}. 


Since the birth rates are linear and the death rat es are quadratic, the expecta tion 
of the stopping time T is finite; see Theorem 2.1 in IPardoux and Salamatl ( 
an explicit expression. 


2nn9l i for 


Remark 2.2 (Interpretation). Clearly, within the above definition, some biological 
interpretation can be given. 

1. The set G is the genome, i.e. the set of all loci (for any individual within the 
population). An element u E G is called a locus. 

2. The parameter p is the recombination coefficient per unit length. 

3. The set [A^] represents N individuals sampled from a population and the par¬ 
ticles within At form the ancestry of the individuals from Aq at time t in the 
past. 

4. A coalescence event within A indicates joint ancestry. 
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5. Instead of talking about left and right particles to follow within the ARG, the 
biological language would rather suggest to talk about upstream and down¬ 
stream genomic sequences. 

Remark 2.3 (ARG as a limiting graph, single crossovers). 

1. The ARG arises as a limiting object within finite Moran models of population 
size N as, N ^ oo. In the model with recombination a finite population of 
N individuals, each carrying a genetic material indexed by G, undergoes the 
following dynamics: 

(a) Every (unordered) pair of individuals resamples at rate 1, that is, one 
individuals dies and is replaced by an offspring of the other individual. 
The offspring carries the same genetic material as the parent. 

(b) At rate p/N, every (unordered) pair {i,j} of individuals resamples with 
recombination, that is, a resampling event occurs, individual j dies, say, 
a third individual r and a random U, distributed uniformly on G, is 
chosen. Then, j is exchanged by an individual, which carries genetic 
material [a,U) from I and \U, b] from r. 

When considering the history of a sample of iV <C A" individuals, two things 
can happen: First, if a resampling event of two individuals within the sample 
is hit, these individuals find a common ancestor, and their ancestral lines 
coalesces. Second, if a line hits a resampling event with recombination, the 
history of its genetic material is split at the corresponding U, and follows along 
two different lines. (Note that this happens at rate {N — l)p/N ~ p.) These 
two lines have a high chance to be outside the sample of N lines T N N. 
As we see, as A" —>■ oo, the ancestry is properly described by the ARG as in 
Definition 12.11 

2. We assume here only single crossovers, i.e. the mix of the genetic material of i 
and r is exactly as just described (rather than taking e.g. [a, Ui] U (C/ 2 , b] from 
i and (C/i, C/ 2 ] from r for some random variables a < C/i < C /2 < b.) 

2.2 Trees derived from an ARG 

In this section we describe sets of trees that can be read off from an ARG and 
discuss some of their properties as well as different constructions of the ARG. A 
construction of the ARG along the genome (see Remark 14.311 will be particularly 
useful in the sequel and will be explained in more detail in Section 14.11 

Definition 2.4 (Genealogical trees read off from A). Let A = {At)t>o be an ances¬ 
tral recombination graph and B C [N] be a subset of the initial particles. For u € G 
we read off the random tree Tu ■= 7)f := T^iA) (in the case B = [A], we also write 
Tu := := 7/(A)) as follows: 

(i) Start with particles in the set B and follow particles along A. 

(ii) Upon coalescence events within A, followed particles are merged as well. If a 
coalescence event within A only involves a single followed particle, continue to 
follow the coalesced particle. 
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(iii) Upon a splitting event, consider its mark U. If n < U, follow the left particle, 
and T u > U, follow the right particle. 

We denote by Tu,t ■= Tut ■— Tuti-T) the set of particles in Tu at time t. Furthermore 
we denote the root of T hy •u- 

Remark 2.5 ((W)coalescent). We will frequently use the notion of an N (-Kingman)- 
coalescent. This is a random tree arising by the following particle picture: Starting 
with N particles, each pair of particles coalesces exchangeably at rate 1. (Alter¬ 
natively, we may say that the total coalescence rate when there are k particles is 
( 2 ) and upon a coalescence event, a random pair is chosen to coalesce.) The tree is 
stopped when reaching a single particle which we denote by • in the sequel. It is 
well-known (see also Example 12.1311 that this random tree converges as —>■ 00 to 
the Kingman’s coalescent. 

Remark 2.6 (Properties of Tu{A)). 

1. Since A is stopped upon reaching a single particle, and no splits occur within 
Tu^ the latter is certainly a tree. In particular, its root may or may not be 
identical to the node of the stopping particle within A. 

2. Note that for M = each tree 7(f is an M-coalescent. Indeed, by ex¬ 
changeability within Kingman’s coalescent, any two particles within this tree 
coalesce at rate 1, independently of all others. 


Remark 2.7 (Unused branches of ARG). If R is the number of recombination events 
in an ARG A, then we can bound the number of different trees in {T^{A))u&g- 
When following the left and right branches at each recombination point, we find 2^ 
different trees. However, since the R recombination points have marks t/i,... , Ur, 
there are at most R -|- 1 different trees arising from u < U(i), [/(j) < u < = 

1,..., R — 1 and u > U(^r-j (where [/(j) is the Rh order statistic of Ui,..., Ur). How¬ 
ever, it is possible that a branch within A which is only followed when considering 
u € iv,b] (for some r; G (G) carries a recombination event with mark U < v. In this 
case, TJI = TJ^ which reduces the number of different trees. For a lower bound of 
the number of different trees with R recombination events in A, we find a minimum 
of two different trees within (T)^(A))ueG if R > 0. 

This somewhat inefficient procedure of generating recombination events which 
do not take effect on t he level of trees has the adva ntage of mathematical clarity 
and has been used bv [Griffiths and MarioramI (jl997l b It is also possible to allow 
onl y recombinatio n events which are used when reading off the trees (7)f^(A))ueG', 
Hudson ( 1983l b The latter procedure has the advantage of being more efficient 


see 


in simulations. 


Remark 2.8 (Gonstruction of {Tu)u&g along the genome). Instead of constructing 
the pr ocess (7))^)tjeG from the present to the past along the ARG A, IWiuf and Hein 
( 1999l l have shown that there is also a construction along the genome. We will recall 
this approach together with approximations of {Tu)u&g related to this construction 
in Section sn 


Remark 2.9 (Outlook on Theorem [T]). Before we go on with introducing more 
objects needed to formulate our main results let us give an outlook on one of them. 
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1 2 3 4 5 6 

Figure 2: Two (interacting) trees with three leaves each read off from a joint ARG 
starting with disjoint sets of leaves. The black tree (at locus u = 0) is {A^), 

while the gray tree (at locus u = v) is {A^). 

1. Our goal is to study 

convergence of the process {T^)u&g as A" —>■ oo. (2.1) 

Since is an A-coalescent for all tt G G, and as the convergence of the 
A-coalescent to Kingman’s coalescent as A —oo is well-known, convergence 
of finite-dimensional distributions in (12.1|) is not surprising. However, we will 
also show tightness of {{Tj^)ueG ■ Af G N} in the space of cadlag paths. This 
requires to define a proper topology on the space of trees, which we will do in 
the next section. 

2. In our formulation of Theorem [H two different sets of trees derived from an 
ARG will arise: 

(a) For A^, we will consider 

{T^^iA^) :u€G}, 

which is the set of all trees with A leaves from an A-ARG. 

(b) For A^ and ni,... ,nj G N with ni = n, consider a partition 

{Hi ,... ,Bj} of [n] with = ni,..., = rij and ui,...,Uj G G. 

Then, we consider the trees 

These trees arise when considering an n-ARG and partition its initial 
particles into the sets Bi,... ,Bj and following their ancestry. See Figure[2] 
for an example of resulting trees and their interaction with each other. 
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2.3 Space of metric measure spaces 


Any (finite or infinite) genealogical tree can be encoded by an (ultra-) metric space 
(A, r) where X is the set of leaves and r is the genealogical distance. If we equip 
the set of leaves with a sampling probability measure we obtain a metric measure 
space (A, r, p ). Spaces of metri c measur e spac es wer e introduced and the ir topolog¬ 
ical properties were studied in Gromov! ( 2007 ^ and Greven et al. ( 2009l b We now 
recall the space of (isometry classes of) metric measure spaces M, the Gromov-weak 
topology on M, and polynomials, which form a convergence determining algebra of 
functions 


Definition 2.10 (mm-spaces). 

1. A metric measure space (mm-space) is a triple (A, r,/r) where (A, r) is a com¬ 
plete and separable metric space and p, € AIi(A) with supp(^) = A. Two 
mm-spaces (Ai, ri, /ii) and (A 2 , r 2 , ^^ 2 ) are called measure-preserving isometric 
if there exists an isometry Lp between Ai and A 2 so that /U 2 = 

2. Being measure preserving isometric is an equivalence relation and we denote 
the equivalence class of (A, r, p) by (A, r, p) and write 

M := {ac = {X,r,ia) : (A, r,/r) is a mm-space} (2-2) 

for the space of measure preserving isometry classes of mm-spaces. 

In order to define a topology on M, we use the notion of polynomials. 

Definition 2.11 (Distance matrix distribution, polynomials and the Gromov-weak 
topology). Let (A, r, ;u) be a mm-space and ?c = {X,r,ij.). 

(N\ 

1. We define the function R := : A^ ^ by R{x) = {r{xi,Xj))i<i^j 

and define the distance matrix distribution 

e Mi(wi^py 


2 . 


A function : M —)• IR is called polynomial if there is a bounded measurable 


function cj) : IR 
that for all ?c = 


—>• IR, depending only on finitely many coordinates, such 
: (A, r^p) € M we have 


^K) = {t>!^^AoR) = {v\4>). (2.3) 


The smallest n for which there is a function f which depends only on coordi¬ 
nates {rij)i<i<j<n so that (|2.3p holds is called the degree of the polynomial <I>. 
We then write instead of to stress the dependence on n and f. The 
space of bounded continuous polynomials is denoted 

n° := : n € N, (/. € Cb(M^2^)}. (2.4) 

(Sometimes we will abuse notation and write (t>{{rij)i<i<^j<n) '■= ■) 

^The first edition of this book appeared in 1999. An even older french version from 1981 did not 
contain the chapter about metric measure spaces. 
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3. The smallest topology on M for which all functions II^ are continuous is 

71 —^OO • p 1 1 . p 

—>■ ar if and only if 

n^oo j. 


called the Gromov-weak topology. For this topology, 

y!fn 

Remark 2.12 (Some properties of polynomials). 


1. We stress that for = {X,rfj,), the measure does not depend on the 

representative and hence is well-defined. 

2. Given two polynomials and one can show that the product • 


is a polynomial of degree n -|- m; see Remark 2.8(i) in iGreven et al 


m 


( 2013lh The spac e 11° is an algebra whic h separates points; s ee Section 3i.5. 


Gromov (2007) and Proposition 2.6 in Greven et al. ( 2009l h 


3. 


The space M equipped with the Gromov-weak topology is Polish. Later in 
Section [2.41 we will give the Gromov-Prohorov metric on M, which is complete 
an d metrizes th e Grom ov-weak topology (see Theorem 5 and Proposition 5.6 


m iGreven et al.l (j2009l lL We will also give the Gromov-Hausdorff metric and 
other metrics on M, which we will need in the formulation and proof of The¬ 
orem [TJ 

Example 2.13 (Kingman coalescent tree as a metric measure space). Consider the 
A^-coalescent from Remark 12.51 Let = [N] = {1,... , A^} be the set of leaves 
and let the metric be the usual tree distance, i.e. r^{i,j) is twice the time to 
the MRCA of i and j, 1 < i,j Si N. Finally, let be the uniform measure on 
. Then •= (K^, u ^) is an equivalence class of a metric measure space. 


Furthermore, by Theorem 4 in Greven et al. ( 2009l i there exist an M-valued random 
variable such that 


(2.5) 

The limiting object ‘}(°° is called the Kingman measure tree. 

2.4 Metrics on metric (measure) spaces 

We now recall the definitions of several distances on M that we will use in the sequel. 
While the Hausdorff distance is a metric on closed subsets of a metric space, the 
Prohorov and total variation distances are metrics on probability measures on a 
metric space. All three distances can be turned into distances on M. 

Definition 2.14 (Hausdorff, Prohorov and total variation distances). Let {Z,d) be 
a metric space and /ri,/i 2 € A4i(Z). 

1. The Hausdorff distanee between two subsets A, B C Z is defined by 

dn{A, B) := inf{e >0:ACBfBC A^}. (2.6) 


where for A C Z 


A^ := {x £ Z : d{x, A) < e} = {z £ Z : 3y £ A, d{y, z) < e}. 
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2. The Prohorov distance of ^ 1,^2 is defined by 

(ip(/Ui,/U 2 ) := inf{e > 0 : ^i(-F) < +£,VF C Z, closed}. (2.7) 

3. The total variation distance of ^ 1,^2 is defined by 

dTv{hl,h2)-= sup - ;U2(^)|- (2.8) 

A£B(Z) 

Remark 2.15 (Total variation distance). 

1. If Z is finite, the total variation distance of probability measures /Ui and ^2 on 
Z is given by 

dTv(m,I^ 2 ) = \hi{{z}) - h2{{z})\. (2.9) 

z£Z 

2. Recall that the Prohorov distance of two probability measures is bounded by 
their total variation distance. 

For all three notions just defined, we now recall the corresponding “Gromov-versions” 
which are (semi-) metrics on M. The idea is always to find an optimal isometric 
embedding into a third metric space and compute there the usual distance of the 
images of the spaces and measures. 

Definition 2.16 (Gromov distances). Let = {Xi,ri, ni) and ?(2 = {X 2 ,r 2 , 1 ^ 2 ) be 
mm-spaces. Moreover, let ipi : Xi —)■ Z and tp2 ■ X2 —)■ Z be isometric embeddings 
into a common complete and separable metric space {Z,d). 

1. The Gromov-Hausdorff distance of and ?C 2 is defined by 

dGH(yi, t! 2 ) := inf dn{^pl{Xl), ^p 2 {X 2 )). 

(pi,(p2,Z 

2. The Gromov-Prohorov metric of and is defined by 

dGp(LL, 12 ):= inf d^{{ipi)^^i,{ip 2 )*ti 2 )- 

ipi,ip2,Z 

3. The Gromov total variation distance of and ?(2 is defined by 

dGTv(^,^)— inf dTv((7’i)*Iii, (</32)*^2)- 

<pi,^P2,Z 

Remark 2.17 (Properties). Let us recall or rather state some known and some 
obvious properties of the distances just introduced. 

1. The distances ^gh 1 ^gp ^ dcTV are well-defined. As can be seen by considering 
isometries between elements of one isometry class the distances do not depend 
on the representative. 

2. Since the Gromov-Hausdorff distance only uses the metric spaces (Xi,ri) and 
(AI 2 T 2 ) but not the measures /ii,// 2 ) it is only a pseudo-metric on M. 
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3. 


According to Lemma 5.4 and Proposition 5.6 in Greven et al. (20^) the Gromov- 
Prohorov metric dcp is indeed a metric on M and the metric space (M, dcp) 
is complete and separable. Moreover , it metrizes the Gromov-weak topology 


by Theorem 5 of Greven et al. ( 20091 ') 


4. Since the Prohorov distance is bounded by the total variation distance, we also 
find that 


C^GP('ll, ^) < dGTv(LL, ^)- (2-10) 

This will be useful later since the total variation distance is usually easier to 
compute or estimate than the Prohorov distance. 


3 Main results 

We now formalize the trees : u G G} as mm-spaces. Our results, then, are 

dealing with these M-valued random processes. In particular, Theorem [1] studies 
convergence as A" —>■ oo. Since is an A-coalescent for all u G G, the 

resulting process is stationary. It is even mixing, as Theorem [2] shows. 

Definition 3.1 (7^ as an mm-space). Let A = for A G N and G = [a,b] 

with a < 6 be an A-ARG. For u G G and B C [A], let T^iA) be as in Definition 12.41 
As in Example 12.131 let r be the usual tree-distance and /r the uniform measure on 
B. Then, (B, r, is a metric measure space. Its isometry class will be denoted 
% := ;= T^{A) in the sequel. If = [A] we write % := := %^{A). 

In Theorem [H we need the notion of the variation of a function which we briefly 
recall. 

Remark 3.2 (Variation). Let {E,r) be a metric space and f : I ^ E for / C R. 
The variation of f with respect to r on sub intervals [a, 6] C / is defined by 

k 

^aif) ■= supj^ r{f{ti), f{ti-i)) : k eN, a = to <ti < ■■■ <tk = by (3.1) 

i=l 

Theorem 1 (Convergence of A-ARGs). Let := {A))u^g be as in Defini- 

tion 1,9.11 Then, T on Dm(G) for some process T. The (law of the) 

process T = {‘Tu)ueG is uniquely given as follows: 

For each j G N, ui,...,Uj G G, ni ,... ,nj G N, let he as in Remark \2.y\ and R. 
be the distances of leaves Bi within ■ Then, for <l>j = <1)"'»’'^» G 11°, i = 1,... ,j, 

E[ci)i(a;j... = e[ME,) • • • (3-2) 

The paths ofT are almost surely of finite variation with respect to Gromov-Prohorov, 
Gromov total variation and Gromov-Hausdorff metrics. 

Remark 3.3 (Path-properties of T). We can ask about path-properties of the lim¬ 
iting process “T. Let us give an example: Let A^ be the number of e-balls that are 
needed to cover In other words A^ is the number of families in % some distance 
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£ from the leaves, 
hence we have eN^ 


For fixed n € G, we know that % is a Kingman coalescent and 
— 2 almost surely. (See e.g. (35) in Aldous ( 19991 '). 1 Is it also 


true that 


P(eA'^ ^ 2 for all w € G) = 1? 


Similar questions arise for well-known almost s ure properties of Kingman’s coalescent 
regarding the family-sizes near the leaves; see AldousI ( 1999l l. Chapter 4.2. 


For our next result, we need to extend the ARC to G = (— 00 , 00 ). This can be 
done using some projective property (see also Lemma [4.lip : Let Gn t (— 00 , 00 ). 
Clearly, (13.2p gives the finite-dimensional distributions of {‘Tu)u£G„ for every n. In 
particular, for m < n, we see that the projection of {‘Tu)u<=Gn to {u € Gm} is 
the same as {‘Tu)ueGm therefore (|3.2I) defines a projective family of probability 
measures which can by Kolmogorov’s extension Theorem be extended to the law of 

(‘lu)ttG]R- 

Theorem 2 (Mixing properties). For n € let and be poly¬ 

nomials of (at most) degree n and {'Tu)ueG be the extension of the limit process T 
from Theorem[J\ to G = IR. Then, there exists a positive finite constant C = C{n) 
such that 

|E[4'(To)$(Tj] - E[4 '(To)]E[$(' 4)]| < ^||^||^||<),||^, n > 0. (3.3) 

Remark 3.4 (Dependency on n ). In our proof we will show that (13.31) holds with 

o 4 

C{n) = ■ Therefore, the bound is only useful in the limit u —>■ 00 for fixed 

n. If n —)• 00 and u is fixed, the trivial bound |E['k(‘2o)<I)(‘21i)] — E['k(‘2o)]E[<h('Z^)]| < 
2||V’l|oo||</'||oo holds as well. It would be desirable to obtain a bound similar to (|3.3I) 
uniformly in n and u, but this seems to be out of reach with the techniques we 
develop here. 


4 Preliminaries 


4.1 Construction of {%p)ueG along the genome 

In Wiuf and HeinI ( I999I '). a construction of an A^-ARG was given, which results in 
the same trees {7'J)^)ueG (or {%f^)u^G) in distribution as described in Definition 12.41 
(and Definition 13.ip . but constructs this tree-valued process along the genome, i.e. 
by starting at tt E G and then letting the trees evolve when moving along G. This 
construction which w e recal l below will be helpful in all further proofs. See also 
Leocard and Pardoux ( 2OI0l i. 


Definition 4.1 (A^-ARG’ along the genome) 


1. For a < 6, G = [a,h] and A^ E N, construct an evolving pair {Q,T) = 
{Qn,'Tn)n=o, 1 , 2 ,..., where Qn is a graph and Tn is a tree as follows: 

2. Start with an A^-coalescent Go = To with set of leaves [A^] (which is continued 
indefinitely, i.e. not stopped upon hitting a single line) and in each step do the 
following (with Uo = a): 


































4 PRELIMINARIES 


12 


(a) Measure the length of the (vertical branches of the) graph L„ = L{Qn), 
choose a uniformly distributed point E Qn and an independent expo¬ 
nential random variable ^n- 

(b) From change the graph as follows: Create a split event at and 
mark it by Un = Un-i + ^n/{LnP)- The line in Qn which starts at Xn is 
called the left branch and a new right branch is created. This new branch 
coalesces with all other branches in Qn at rate 1. The resulting graph is 
Qn+i and Tn+i is given by following Tn until the right branch at Xn is hit 
and waiting until this branch coalesces back with 7^. (There is a chance 
that Xn ^ 7^; in this case we have 7^+i = Tn ) 

Stop when Un > b and set Q = Qn- 


3. Let us now consider the final graph Q. This is a coalescing-splitting random 
graph (similar to A in Definition 12.ip and therefore can also be considered as 
an evolving set of particles which coalesce and split, where splitting events are 
marked by some element of G and are continued by a left and right branch. 
Hence, we can define for a subset B C [A^] and u € G the random tree 
Su ■= Sn '■= Bn(Q) as in Definition 12.41 Again, we set ■= 

Remark 4.2 (Properties of (<S;f°)„e(G). 

1. From Wiuf and HeinI ( 1999I L the graphs A and Q have the same distribution. 


2 . 


hence the same is true for the processes (7(y )ug<g and (Sn)u€G- 

Let C C be a line of length I starting in some leaf x E [A^] and reaching 
in the direction of the root. Then, C C for some re E G if and only if no 
recombination marks C before reaching w. By construction, this happens with 
probability Moreover, let u < v < w. Then, (given S^), C C 

with probability Q-p\r-u\(.^ independent of the event C C . 

3. By construction, Q from above is an ARG and therefore, its length has finite 
expectation. Hence, the construction above terminates almost surely. 

Remark 4.3 (Approximating the ARG usi ng a Markov process a long the genome). 
One striking feature of the construction of Wiuf and Hein ( 1999l i is that it allows 
to approximate {Tn)u&G by a Markov process by changing the dynamics of the 
construction {Qn,Tn) in order to obtain a Markovian dynamics T = {Tn)'- 


1. The following approximation was used by MeVean and CardinI ( 2005I L Instead 
of (a) in Definition STI measure the length of the (vertical branches of the) tree 
Ln = L{Tn), choose a uniformly distributed point Xn E Tn and an independent 
exponential random variable ^n- Then, instead of (b), change the graph as fol¬ 
lows: Greate a split event at Xn, and mark it by Un = Un-i+fn/{LnP)- Delete 
the branch which connects Xn to its ancestral node from Tn, all other lines 
are available for coalescence. Then, start a new branch in Xn which coalesces 
with all other available branches rate 1. The resulting tree is Tn+i- In par¬ 
ticular, by only allowing coalescences with T^, this approximation becomes 
a Markov process, also called the Sequentially Markov Coalescent (SMC). (In 
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Figure 3: A conditional embedding of the trees and into a common tree. 


SMC’, Chen et al. ( 2009l l use almost the same construction but without delet¬ 
ing the branch connecting Xn to its ancestral node for the set of lines available 
for coalescence, leading to a better approximation.) 

2. Another simul ation softwa r e bas ed on the Markovian Coalescent Simulator 
(MaCS) from Chen et al. ( 20091 1. has all lines from the last k genealogical 
trees as available for coalescence after a recombination event. 


While approximations such as SMC and MaCS make genome-wide computer sim¬ 
ulations under recombination feasible, their construction differs from the ARC at 
least for loci which are far apart. In particular, Theorem [2] would not be true for 
these approximations. 


4.2 Conditional distances of trees and first upper bounds 

To obtain useful bounds on expected tree distances introduced in the previous section 
we will condition on one of the trees and use the construction of the tree-valued 
process along the genome from Section [4. II Let us start with an illustrative example. 

Example 4.4 (A conditional embedding of two trees and upper bounds; Figure [3]). 
Given the tree drawn in solid lines on the left of Figure El we can read off the 
times S 2 , ■ ■ ■, S 5 which the tree spends with exactly 2,... ,5 lines. These random 
variables are independent and each S/. is exponentially distributed with mean 1 /( 2 )- 
The black bullet indicates a recombination event with a mark between u and v, and 
5^ can be read off by following the dashed line. 

On the right hand side of Figure [3] the trees (black) and 5^ (gray) are em¬ 
bedded into a common tree. The embedding of into the common tree is given 
by Xj Xi,i = 1, 2,3 and Xj 1 -^ x',i = 4, 5. Using (|2.10p and (14.ip below it is easily 
seen that if the trees and 5^ are equipped with uniform probability measure, we 
have 

^^gp(‘ 5^,‘5®) < dGTv(5^,‘5^) < 2/5. 

From the same embedding we see also that the Gromov-Hausdorff distance of 
and S^, conditioned on is bounded by the tree distance of {x 4 ,X 5 } and xi 
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which is twice the time of back coalescence of the dashed recombination line. Thus, 
in this particular case we have 5^) < 2(^2 + • • • + S^). It could of course 

happen that the dashed line coalesces back after the time of the MRCA of S^. In 
this case the upper bound would be 2(5i + • • ■ + S5), where 5i is another independent 
exponentially distributed random variable with mean 1. 

First, we explain a way to compute the Gromov total variation distance explicitly. 

Remark 4.5 (How to compute dcrv)- If the spaces Xi,X 2 are finite with #Xi = 
4 ^X 2 = N and is the uniform distribution on Xi,i = 1,2, then one can compute 
the Gromov total variation distance explicitly. There are isometric embeddings (pi of 
Xi into a common finite metric space {Z, d), i = 1, 2, so that Z can be decomposed 
in three disjoint sets Z = Zi tt) Z2 W ^joint with the property Pi{Xi) = Zi Li ^joint, 
i = 1, 2. The optimal embeddings are the ones for which ^Zjoint is maximal. Using 
such optimal embeddings we have 


dcTvi^, ^[Zi)) + -p.2{p2^ (^ 2 )) 

+ I E = ^ = (4.1) 

As it turns out, to obtain upper bounds on Gromov total variation and therefore 
also the Gromov-Prohorov distances it is helpful to introduce yet another distance 
which is only well-defined on trees from the processes {Ti^°)u&<g- 

Definition 4.6 (Auxiliary distance). Let {7'^)ueG be as in Definition 12.11 

Recall the MRCA within , denoted by •u, from Definition 12.41 For u, u € G, we 
decompose [N] into two subsets: 

1. We denote by the set of all x E [A^] such that the path within from 

X to •u is not hit by a splitting event marked with U E [u,v\. 

2. Then ^ is the set of all x E [A^] with such a splitting event on the path 
from X to •u in ■ 


We define 



(4.2) 


Proposition 4.7 (Properties of the auxuliary distance). Let doTV be as in Defini- 
tion \ 2.1 61. daux, CLS in Definition \4-6i {7'J^)ueG Definition \2.4\ and 

as in Definition \S.l[ 


1. The distance daux is an upper bound for dcTV the sense that, for all u, u E G 


dGTv('4^ < d 


•u,V 

aux 



2 . For u,v,w ^ G with u < v < w, the distances d*aXfi{T^ ^ ) and daulT 

are independent given . 
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Proof. 1. Since the Gromov-Total-Variation distance is defined as the infimum over 
all embeddings into a common mm-space, it suffices to bound the Total-Variation 
distance on a concrete embedding. Therefore, we use Remark 14.51 and write Z = 
1+) Z 2 tt) Zjoint with Zjoint = and Zi, Z 2 are two copies of Distances 

on Zjoint within and are identical by construction and we see from (|4.ip that 

j (rr^ rl ('T~^ 'T~^ \ <r _ J»n 

^GTVy^u •> !v ) — jy ^auxv'u i ’v )‘ 

2. From Definition 14.11 and Remark 021 we see that the triple ,Tj^,T^) can 
be constructed starting with (with a = v,b = w). The resulting graph Q is 
then used to again use the same procedure with initial state {G,TJ^) and use the 
same procedure (with a = v and b = u) this time moving to the left of v. In total, 
this results in marking at rates p{w — v) and p(v — u) independently. Leaves 
which are marked at rate p{w — v) {p{v — u)) until is hit, are elements of 

Importantly, since the marking on [u,u] and [u,?/;] are independent (given 
- see Remark 14.21 - the claim follows. □ 

Remark 4.8 (Bound of dcTV by daux not sharp). It can be easily seen that the strict 
inequality dGTy{%j^,'TJ^) < dluf {Ej^,Ej^) is possible. This happens for instance if 
the dashed line in Figure [3] coalesces immediately with the same line. 

Proposition 4.9 (Bounds on daux)- Let v,w ^ G with v < w and {EJ^)u&g be as 
in Definition \4.1\ For E^, let S 2 , S 3 , ... be the duration for which 2,3,... lines are 
present in the tree. We have 

N 

<p\w-v\Y,Sk. (4.3) 

k=2 

Proof. For x Aq, let £x V Ej^ be the path from x to •^. Its length is given by the 
tree height L := S 2 + ■ ■ ■ + Sn- 

Given EJ^, x € with probability 1 — by Remark 021 2. Hence, 

by exchangeability and the definition of daux, 

E[d:;if (7;'^,r^^)ir,'^] = i - < p\w - v\l. 


4.3 Projective properties of the ARG 

It is well-known that Kingman’s coalescent is projective in the sense that the tree 
spanned by a sample of n leaves from an V-coalescent has the same distribution as 
an n-coalescent. The same holds for the ARG as we will show next. 

Lemma 4.10 (Projectivity in N for the V-ARG). Let G = [a,b], p > 0 and A be 
an N-ARG (with Aq = [A] = {!,... ,N}). Let B C [N] with ffB = n and let ttbA^ 
be the random graph which arises from the particle system starting with particles B 
and following them along A. (Upon coalescence events within A^, followed particles 
merge as well. If a coalescence event within A only involves a single followed particle, 
continue to follow the coalesced particle. Splitting events hitting a followed particle 
are followed as well.) Then, ttbA^ is an n-ARG. 
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Proof. It suffices to consider the particle-counting process of since the fine- 

structure of is exchangeable. Clearly, pairs of particles coalesce at rate 1 and 
every particle splits at the same rate p{h — a). Hence, it coincides with the particle¬ 
counting process of A^ and we are done. □ 

The projectivity of the A^-ARG along the genome is stated next: 

Lemma 4.11 (Projectivity in (0 of the A^-ARG). Let G = [a,b], p> 0, .A^(G) be 
an N-ARG and H = [c, d] C G. Let •n-B_A^ he the random graph which arises from 
the particle system starting with particles [A^] and following them along A. (Upon 
coalescence events within A^, followed particles merge as well. If a coalescence event 
within A only involves a single followed particle, continue to follow the coalesced 
particle. Splitting events hitting a followed partiele are followed as well if the mark 
falls in H.j Then, equals in distribution. 

Proof. It suffices to see that (i) the recombination events at loci in [a, 6] \ [c, d] split 
off ancestral material not in [c, d] and therefore don’t change the genealogical trees 
in and (ii) coalescences with such lines don’t appear in genealogical trees in 

H. Leaving out these recombination events hence leads to .4'^(1H), so the claimed 
equality follows. □ 


5 Proof of Theorem [T] 

The proof of Theorem [1] requires three steps. First, in order to obtain existence of 
limiting processes along subsequences of we have to prove (see Sec¬ 

tion [5T]) 

The family (T'^)jv=i, 2 ,... is tight in T>]M(d). (5.1) 

Second, we show in Section 15.21 

For any limiting process T along a subsequence of /c, 

(T^)7v=i,2,..., (132]) holds. ^ ' 

This equation determines uniquely the finite-dimensional distributions of T since 
polynomials are separating; in particular, since the right hand side of (13.2|) does 
not depend on the subsequence, uniqueness of the limiting process follows. Third, 
bounds on the variation process of T are given in Section 15.31 such that 

The paths of T are a.s. of finite variation with respect 
to Gromov total variation, Gromov-Prohorov and Gromov- (5-3) 

Hausdorff metrics. 


5.1 Tightness in I1 ]m(G) 


In ord er to prove tightness in the s ense (15. ID. we relv on Th eorem 13.6 in [Billingsley 
(I 1999 I I (see also Theorem 3.8.8 in lEthier and Kurt d (jl986l H. i.e. we will show that 
there is C > 0 such that 


limsupE[dGTv(T)l, %^) ■ doTvilf, ‘4^)] < Ch"^. (5.4) 

N^OO 
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Since dcp < ^gtV) this implies tightness with respect to the Gromov-weak topology. 

We assume without loss of generality that the interval [—h, h] is contained in (G. 
Note also that is stationary. We combine Proposition 14.71 and Proposi¬ 

tion 14.91 and write, for Exp(^), 


E[dGTY{F^h, %^) ■ dGTY{%^, O] < 7/)] 

N 2 
k=2 


The last estimate follows from the following elementary computation: 


E 


N 

[(Es. 


2i 


N 


k=2 


J2nsl] + 2 Yl IE[5fc]E[5,] 

k=2 2<k<i<N 

oo oo 2 

Ynsi]+2[Y^iSk]) 


k=2 

CO 

E 

k=2 


k‘^{k — 1)^ 


k=2 


+ 2 


CO 

(E 

1=2 




7T^ 


= 8(y-3) +8< 11. 


Therefore, we have proved (|5.4p . 


5.2 Finite-dimensional distributions 

Let = 1,..., j be (bounded) polynomials and {‘Tj^)ueG be as in Defi¬ 
nition [3T] with ‘X^ = , where = {[N],r{^,p) is as in Definition 12.41 Then, p 

is the uniform distribution on [A^] and gives distances between elements of [A^] 
in . Let 


An,N = y e [A^]” : Xi = Xj} C [A^]” 

l<i<j<n 

be the event that some entry in x G [A^]"' appears twice. Then, we find that 
fj,®"'{An^N) = 0{1/N) (for fixed n) as N ^ oo. Therefore, by construction, set¬ 
ting Xfc, = (xfc, ...,xi) and r{xi,i,x^.i) = {r{xi, Xj))k<ij<i, no = 0, Uj = ni H-hn^. 


2=1 
3 


= lim E 

N^oo 


2 = 1 


= lim E 

N^oo 


= E[0i(^^)...0,(^p]. 


2=1 
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5.3 Finite variation 

In this subsection we prove the last part of Theorem [H namely that the paths of the 
process T = {‘Tu)u£G are of finite variation with respect to Gromov total variation, 
Gromov-Prohorov metric and Gromov-Hausdorff (semi-)metric on M. Recall the 
definition of variation with respect to a metric in (j3.ip . First we show that for 
d € {dcTY, dcp, dcu} that ior u,v & G, u < v there is a positive finite constant 
C = C{p) such that 

E[di%^,%^)]<Civ-u). (5.5) 

Once this is proven for a metric d, for any interval [a, 6] C (G and a partition 
a = uq < ui < ■ ■ ■ < Uk = b of that interval we have by the first part of Theorem [1] 

< limsupE[d(a;^,'rj^_J] < C{ui - m-i). (5.6) 


Then it follows easily 


E 


K 

<C{b-a). 


i=l 


Since the right hand side does not depend on particular partition of [a, 6], this shows 
that the variation of T with respect to d has finite expectation on finite intervals. 
Thus, the paths of T are almost surely of finite variation with respect d. 

For Gromov total variation and Gromov-Prohorov metrics (15.5p follows from 
^GP < ^GTV < f^aux and (14.31) . For the Gromov-Hausdorff metric (15.51) is shown in 
the following lemma. 

Lemma 5.1. There is a positive finite constant C independent of N so that for any 
u,v ^ G, u < V we have 


ndGn{r^,%^)] < Cp{v - u). (5.7) 

Proof. Given the tree as before we denote by S 2 ,... ,Sn the time for which 
exactly 2,... ,N lines are present in the tree (cf. Figure [3|). The random variables 
S 2 ,..., S]sf are independent and is exponentially distributed with mean 1 /( 2 )- 
Along the branches of recombination events occur at rate p{v — u). When a 
recombination event occurs at level /c, that is during the period of time with exactly 
k lines in the tree 7))^, then the resulting extra line coalesces back into the tree 
at some time after the recombination, that is at level for some 1 < (. < k. We 
also need to consider the level ^ = 1 because it might be the case that the extra line 
coalesces back into the tree after all lines of have coalesced with each other. 

Let Si be exponentially distributed with mean 1. Furthermore, for k > 2 and 
1 < I < k let be the event that along the k branches of during time S^, at 
least one recombination event occurs that separates the trees and , and the 
resulting extra line coalesces back into the tree during time Sg. Figure [3] shows 
an example of the event ^ 3 ^ 2 - 
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Then, ignoring the probability of no coalescence during Sk (hence bounding this 
probability from above by 1), 

fc-i 

7n=i+l 
k-1 


K — L 

< p{v - u)kSkiSe 


„—mSm 


.= 11+1 


Note that 


fc-i 


e[ n« 

m=l+l 


—mS„ 


k-1 

n 

m=£+l 


k-2 


_^_= TT_^_ 

l + 2 /(m- 1) J-j- 1 + 2/m 


k-2 k-2 

= exp - E log(l + 2/m)^ < exp ^ 1 /m 

m=i m=i 


m=i 


-“Pv J, k-1- 

Furthermore, given 7^, on the event Ak^(, we have 

N 

rfGH(rj^,r/)<2^s',-. 

j=e 

It follows that for some Cq, Ci,C 2 , C 3 > 0, which don’t depend on N, p, u and v, 

N 

limsupE[dGH(T/^,7/^)] < hmsup E e[2 E Sj ; 

N^oo N^oo /j^t^AT ^ 

£<k<N j=£ 


i<k<N j=i 
N 

< lim sup Co E E®+i'p+.'l 

N^co £<k<N j=e 

<limsupCip(u-u) ^ 

i<k<N j=i { 2 ) ( 2 / ( 2 ) 

< lim sup C'2p(u - n) ^ ^ Cspiv - 


u) 


N^oo 


i<k<N 


which shows the assertion. 


□ 


6 Proof of Theorem [2] 

Theorem [5] claims that correlations between trees % and % decrease with 0{l/v‘^). 
Such correlations come with coalescence times present in % and %. Before we come 
to the proof of Theorem [51 we study such joint coalescences. 
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6.1 Covariances of coalescence times 

Lemma 6.1 (Covariance of distances at n = 0 and u = v). Let $ = Then, 

Proof. From Theorem [H we see that the left hand side of (j6.ip is given as follows: 
Let be an ARC starting with four lines, i?i 2 ,o be the distance of the pair 1,2 at 
u = 0 and i? 34 ,^ the distance of 3,4 at u = v. Then, for all cf has 

the same probability as -Ri 2 ,o = R'iA,v (recall that (/> is a function in the definition of 
polynomials is a function of pairwise distances; see (|2.3p l. 

Hence, the LHS of (|6.ip equals x, where 

X = P(i?l2,0 = -^34,1;) 

y = P(i?i 2,0 = R23,v) 

Z = P(i?i2,0 = Ri2,v) 


The first event in the ARC starting in four lines, can be: 


(i) coalescence of one of the pairs (1,3), (2,3), (1,4), (2,4) 

(ii) coalescence of one of the pairs (1,2), (3,4) 

(hi) Some recombination event. 

In case (hi), the probability x is not changed after the recombination event, in case 
(ii), there is no way that the event Ri 2 ,o = R 34 :,v (hence has probability 0). In case 
(i), however, the probability is the same as in an ARC with three lines, that lines 
1 and 2 at locus 0 coalesce at the same time as lines 2 and 3 at locus v. This 
probability is defined to be y. Similar arguments for a first-event-decomposition in 
the probabilities y and 2 ; lead to 


2 1 

X = -y — -0, 
3^ 3 ’ 


y = - X - 2 ; -I-• 0, 

pv + 3 pv + 3 pv + 3 

_ 2 pv 1 

^ 2pv+ 1^ ^ 2pv+ 1 '^' 


( 6 . 2 ) 


Solving this linear system gives the result. 


□ 


Using the last lemma, we immediately obtain another useful result. 

Corollary 6.2 (Samples of size n > 2). Let n> 2, A^"^ he a 2n-ARG, Rij^u be the 
distance of the pair i,j at position u for u G {0, v}. Then, 


F{Rijfi = Rke,v for some l<i<j<n] n + l<k<i< 2n) 

2 

/ n \ 

< 


9 -b 13pv + 2p^v 


,2^)2 
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6.2 An auxiliary random graph 

For the proof of Theorem [2l we recall the 2n-ARG for loci v E {0,?/}. We let 
Rij^v be the distance between i,j at locus u for u E {0, u}. We set To = and 

Tv = 7^1 "'+^’•••’^"'1 (recall the notation from Definition 12.41) . The following events can 
happen: 

1. “Intra-tree” coalescence events: If in two particles coalesce and both par¬ 
ticles belong to T, then the total number of particles and the number of 
particles in Tv decreases, v E {0,u}. 

2. “Inter-tree” coalescence events: If two particles (in coalesce and one of 
the particles is present in TJi \ To, and the other is present in To \ TJi, then the 
total number of particles decreases but the numbers of particles within To and 
Tu are preserved. 

3. “Splitting recombination” events: If a particle, present in the overlap of the 
trees TL n To, recombines with mark U E [0, rt], then the particle splits in two 
new particles, one present in T(i \ To, the other one present in To \ Tit. 

We call a branch in a single line if it belongs to (To \ T) U [Tu \ To), whereas 
branches in To fl Tit are called double lines. Intra-coalescence occur simultaneously 
within To and Tu if two double lines coalesce. These are the events that make To 
and Tv dependent. We will call such an event joint coalescence. 

We now define a random graph based on such that we can couple To and Tu 
with two independent trees To and Tu, both having the distribution of a Kingman’s 
n-coalescent; see Lemma 16.51 

Definition 6.3 (An auxiliary random graph). Define a random graph from 
which two trees To and Tu can be read off as in Definition 12.41 as follows: Starting 
with 2n single lines, where 1 ,..., n E To\Tu and n-|-l,..., 2n E Tu\To, the dynamics 
of the lines in A^” are as follows (see Figured}): 

(i) Each pair of single lines coalesces at rate 1. The result can be a single line (if 
both lines belong to To or both to Tu) or a double line. 

(ii) Each pair of lines where one is a single line and the other a double line coalesces 
at rate 1. The resulting line is a double line. 

(iii) Each double line splits at rate pu into two single lines. 

(iv) Between each pair of double lines there is a coalescence/splitting event at rate 
2. This event produces a double line and a single line. With probability 1/2 
the resulting single line is in To or in T/, respectively. 

Remark 6.4 (Properties of To and Tu)' Note that the above dynamics in A^” 
are the same as in A^"" except for the coalescence/splitting event described in (iv). 
The corresponding event in A^”' was called joint coalescence above. In particular, we 
remark that we can perfectly couple A^” with A^” until the first coalescence/splitting 
event occurs. 
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Lemma 6.5 (Properties of .4^”'). We note the following properties of : 

1. To and Tu are independent and distributed as n-coalescents. 

2. If we couple and until the first event (iv) happens, and let them evolve 
independently otherwise, then 

{no event (iv) happens} C {To = To} H {Tu = T}- 

3. For n>2, there is C = C{n) > 0 such that 

P(no event (iv) happens) < CKp^u^). 

Proof. 1. Obviously in each pair of lines in To coalesces at rate 1 and also each 
pair of lines in Tu coalesces at rate 1, so that both trees are Kingman’s coalescents, 
and the trees are independent by construction. 

2. Denoting by A the event that at a coalescence/splitting event in A^^ occurs, we 
have A^^ = A^^ on by construction. 

3. By construction, A occurs at rate 2 for every pair of lines within 2joint- For 
non-negative integers a, b and c we indicate by IPabc computations of probabilities 
within A^'^ with start in 

• a single lines within To, 

• b double lines, 

• c single lines within Tu- 
Then, for 


X — P202(4), y — Piii(4), z — Po 2 o(^)) 

we obtain the same set of equations as in (16.21) with the last one replaced by 

.. 2 ^ 

2pu + 2^ 2pu + 2 

Solving this system gives 


2 

^ 9-1- 7pu + p'^u^ ’ 

which shows the assertion for n = 2. As in Corollary 16.21 we obtain for all n = 
2,3,... 


P(some event (iv) happens) < 


ny 2 

2/ 9 + 7pu + p'^u'^ 


which concludes the proof. 


□ 
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(iii) 


(i) 


(iv) 


(ii) 


Figure 4: Reading off trees at different loci starting with disjoint sets of leaves from 
modified ARG with n = 3. Some of the events are annotated according to 
the description. The dashed ellipsis encloses the event which is not possible in the 
original ARG, cf. Figure EJ 

6.3 Proof of Theorem [2] 

Let T = and According to Theorem [U we need to consider a 2n- 

ARG and let Rij^v be the distance between i,j at locus u for u G {0, u}. Writing 
^ := {Rijfi)i<i,j<n, ■= {Rij,v)n+I<i,j<2n, Theorem □ gives 

COV[T('ro), <^{%)] = COV[V^(4), mj. (6.3) 

Let To, be as in Lemma [631 which are coupled with To, Tu before the first coales¬ 
cence/splitting event happens. Let R^ and R^ be the (hnite) distance matrices that 

correspond to To and Tu- Slightly abusing the notation we write 

ijjQ = 4>u = <P{EJ, '00 = 0(Jo) and 

Denoting by A the event that a coalescence/splitting event in occurs, we have 
using Lemma 16.51 

E[0O0m] = IE[V>O0«lyi0 + E[V)o0m1a] 

= E[^o0«1a0 + E[V)o0«1a] 

= E[^O0«] - E['0o0«1a] + E[V'o0u1a] 

= E[^o]1E[0„] - E[^o0«1a] + E[V'o0«1a] 

= E[0o]E[())„] - E[0o0«1a] + E[0o0«1a]- 

R follows now that for C = 2P(A), we have 

|E[V^o00 - E[^o]E[</.0| < CIlV^llooll^lloo, 
which, in view of Lemma 16.51 3. shows the assertion of Theorem [2] . 
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